OPTIMAL BOUNDS ON THE MODULUS OF CONTINUITY OF THE 
UNCENTERED HARDY-LITTLEWOOD MAXIMAL FUNCTION 
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Abstract. We obtain sharp bounds for the modulus of continuity of the uncentered maxi- 
mal function in terms of the modulus of continuity of the given function, via integral formulas. 
Some of the results deduced from these formulas are the following: The best constants for Lip- 
schitz and Holder functions on proper subintervals of R are Lip Q (M/) < (1 + a)^ 1 Lip Q (/), 
a e (0,1]. On R, the best bound for Lipschitz functions is Lip(Af/) < (y/2 — l)Lip(/). In 
higher dimensions, we determine the asymptotic behavior, as d — > oo, of the norm of the 
maximal operator associated to cross-polytopes, euclidean balls and cubes, that is, l v balls 
for p = 1, 2, oo. We do this for arbitrary moduli of continuity. In the specific case of Lipschitz 
and Holder functions, the operator norm of the maximal operator is uniformly bounded by 
2~ Q / ? , where q is the conjugate exponent of p = 1, 2, and as d — ¥ oo the norms approach this 
bound. When p = oo, best constants are the same as when p = 1. 



1. Introduction. 

The constants appearing in the weak and strong type inequalities satisfied by the Hardy- 
Littlewood maximal operator, in its different variants, have been subject to considerable 
scrutin y We mention, for instan c e, [UF] , [ATI], pel] . [Me2] , [GMMj . |GM] . [UK], [BP] , 
[CLM] [StT], [St2] . jSt3] . pW], [Bori2] . [Holla] . [Ha] , pu] . jStSt] . [AT2] . [A13] . |AlPe4j . 
|AlPe5] . |NaTa] . and the references contained therein. Interest lies not only in determining 
sharp inequalities, which in general are hard to come by (in fact, no best constants are known 
for dimensions larger than one) but also in finding out how constants change as certain 
parameters (for instance, the dimension) vary, or when the type of set one is averaging over 
is modified, or the space of functions one is considering is changed. 

Here we study the issue of optimal inequalities satisfied by the uncentered maximal operator 
M, and also its asymptotic behavior, but from a different viewpoint: Instead of considering 
weak and strong type inequalities, we analyze the properties of M in connection with the 
modulus of continuity of a function. The overall emerging pattern reveals that the uncentered 
maximal operator improves regularity, by preserving moduli and reducing constants. But in 
general there is no "qualitative" improvement in the modulus. For instance, it may happen 
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that there is no change in the Holder exponent of a function, not even in a weak, almost 
everywhere sense. More precisely, we will see that there are Holder functions / with exponent 
a G (0, 1), and no better than a on a set of positive measure, such that Mf is also no better 
than Holder (a) on a set of the same measure, cf. Example 14.81 We note that the preservation 
of regularity does not extend to C 1 functions, see Remark 14.71 

This article is part of a wider project, which attempts to find out under which condi- 
tions and to what extent the Hardy-Littlewood maximal operator improves the regularity of 
functions, in different settings. If there is such an improvement, then one can try to prove 
variants of inequalities involving derivatives (for example, Gagliardo-Niremberg-Sobolev type 
inequalities) with DMf replacing Df. In applications one often needs to consider functions 
more general than those belonging to Sobolev spaces, so it is natural to look for this kind of 
inequalities under as little regularity as possible. It is also possible to consider other maximal 
operators, associated to smoother approximations of the identity, but we do not pursue this 
line of research here. 

The study of the Hardy-Littlewood maximal operator acting on spaces that measure 
smoothness was initiated in |Ki] by J. Kinnunen , who proved its boundedness on W 1,p (M. d ) 
for 1 < p < oo, and also on the Lipschitz and Holder classes (without increasing the corre- 
sponding constants); see also |KiLij . |HaOnj . |KiSaj . |Taj . |Luj . In |AlPe] . part of the project 
outlined in the previous paragraph is carried out for functions of bounded variation and 
d — 1 (the situation when d > 1 is still not well understood, cf. |AlPe2] and |AlPe3] ) . Unlike 
the Holder case, here a qualitative gain in regularity does occur (cf. Theorem 2.5): If / is 
of bounded variation, then the uncentered Hardy-Littlewood maximal function Mf is abso- 
lutely continuous (however, the centered maximal function need not even be continuous), and 
furthermore, the variation is not increased by M. As application, a Landau type inequality 
under less regularity is presented in Theorem 5.1 of |AlPe] . 

Given (M. d , || • ||), where || • || is an arbitrary norm in M. d , and / : M d — > R, we present 
optimal integral formulas for the modulus of continuity of Mf in terms of the modulus of 
/. Note that distances appearing in the moduli of continuity and balls defining the maximal 
function, are determined according to || ■ ||. However, we always use the same underlying d 
dimensional Lebesgue measure, regardless of the norm under consideration, i. e., there are 
no different normalizations of the measure for different norms. We take the viewpoint that 
there is (essentially) only one norm in dimension one, the usual absolute value, and this fixes 
Lebesgue measure in every dimension by the requirement that a cube of sidelength one has 
measure one. This also forces us to utilize the standard definition (via the euclidean length) 
and the standard normalizations, when dealing with Hausdorff measures. Such a convention 
(or some analogous consistency condition) is needed, for instance, to use Fubini's Theorem, 
and more generally, the coarea formula. 

A very brief exposition of the contents of this paper follows next. The main integral 
formulas of the paper, valid for an arbitrary norm in the global case, where the domain 
under consideration is the whole space M. d , appear in Theorem 12.71 and Corollary 12.141 These 
formulas are then specialized to the following three norms and their associated maximal 
functions: The norm (cf. Theorem 13. ip . the £2 norm (cf. Theorem 13. 14j) . and the i\ 
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norm (cf. Theorem 13 . 16D . Their associated maximal functions are respectively defined by 
averaging over cubes, euclidean balls, and cross-polytopes. A few consequences of these 
Theorems are the following: The norm of the maximal operator acting on Lipschitz functions 
is II^^Hop(i) = v^2 — 1 for d — 1, cf. Corollary 13.61 For the 4» norm the best constants 
in dimensions 2 and 3 are approximately 0.574 and 0.66155. The exact values appear in 
Corollaries 13.71 and 13.101 For arbitrary d, ||M||o P (i) > (d — l)/(c/+ 1), and the error in 
this estimate is o(l/(d+ 1)), cf. Corollary 13.41 For the i\ norm all constants are exactly 
the same as for the norm, see Theorem 13.171 By way of contrast, we note that in the 
euclidean case and on the class of Lipschitz functions, ||M||o p (i) < in every dimension, 
and this bound is optimal. With respect to the preceding results, some open questions are 
mentioned; we indicate one now: Since HM/H^ = WfW^ and (under the euclidean norm) 
ll-DM/H^ < H-D/Hoo, the maximal operator is a contraction on W 1,oc, (R d ), the Sobolev 
space of essentially bounded functions with essentially bounded derivatives (note that the 
contraction is not strict, consider for instance the constant functions). Is there some analogous 
result for p < oo sufficiently large? 

The last section of this paper deals with the maximal operator on proper subintervals of 
R. In this case, the operator norm of M when acting on the Holder and Lipschitz classes is 
||iW|| op(os) = (1 + a) -1 (cf. Corollary 14. 5p . The local, higher dimensional case is not studied 
here; we only point out that constants when d > 1 depend on the geometry of the domain, 
as was to be expected, cf. Remark 14.91 

Finally, we mention that the centered maximal operator M c associated to euclidean balls 
satisfies ||^ c |lo p ( a ) = 1 i n all dimensions, so Lipschitz and Holder constants are in general not 
reduced (cf. Remark 12 .4p . Thus, the preceding results, together with Theorem 2.5 of |AlPe] 
mentioned above, suggest that from the viewpoint of regularity the uncentered maximal 
operator is a more natural object of study than the centered one. 

2. Definitions and global results for arbitrary norms. 

Definition 2.1. Let U C M. d be an open set, let || • || be a norm on M. d , and let B be a 
generic ball with respect to this norm. Given a locally integrable function / : U — > R, the 
noncentered Hardy-Littlewood maximal function Mf is defined by 

Mf(x) := sup jL [ \f(y)\dy. 

x&BdU |-D| JB 

Here \B\ stands for the Lebesgue measure of B. Regarding the centered maximal function 
M c f(x), one requires that balls be centered at x rather than just containing it, but everything 
else is as in the uncentered case. 

Definition 2.2. The modulus of continuity of a function / is 

u>(f,6) :=sup{|/(z)-/(j/)|:||a:-y||<<f}. 

We point out that both definitions depend on the norm under consideration (while the un- 
derlying Lebesgue measure does not). Although maximal functions and moduli of continuity 
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obtained via different norms will always be pointwise comparable, from the viewpoint of best 
constants distinctions cannot be neglected. Indeed, when the euclidean (£ 2 ) and the max 
(£oo) norms are used in M. d , d > 1 the results vary. Somewhat surprisingly, since already in 
dimension 3 cubes and cross-polytopes are very different geometrical objects, best constants 
associated to the norm and arbitrary moduli of continuity are exactly the same as those 
obtained under the £\ norm. Let us emphasize, though, that our results say nothing, for 
instance, about the maximal function associated to cubes when the length used is the eu- 
clidean distance. For us, fixing a norm fixes the balls we average over; once we select, say, the 
euclidean norm, the maximal function considered in our theorems will be the one associated 
to euclidean balls. 

The following theorem is due to Juha Kinnunen, cf. |Kij . While not explicitly stated there, 
its proof appears in |Kij . pp. 120-121, Remark 2.2 (iii). A small variant of the argument 
yields the boundedness in W l,p (M.), 1 < p < oo, of the maximal operator (Remark 2.2 (i) of 
|Ki] ) ; an abstract version can be found in Theorem 1 of [HaOnJ, where it is applied to the 
spherical maximal operator. 

In the next theorem Ai denotes a generic Hardy-Littlewood maximal operator (it could be 
centered or noncentered, associated to euclidean or to other balls). 

Theorem 2.3. (J. Kinnunen). Let f : IR d — > R be locally integrable. Then, for every 5 > 0, 

(2.3.1) u(Mf,8)<u(\f\,S). 

Proof. Let h,x G R d and set fh(x) := f(x + h). By commutativity of A4 with translations, 
Mf(x + h) = Mfh{x), and by subadditivity, \Mfh — Mf\ < M(\fh\ — \f\)- Since averages 
never exceed a supremum, we have sup xeR d Ai(\fh\ — \f\){%) < su PieM d (l I fh\ ~ \f\\){ x )i an d 
now (12.3. ip follows by taking the sup over < 5. □ 

Remark 2.4. This simple result already contains the sharp bound for the centered maximal 
operator M c on the Lipschitz and Holder classes. Let ip(x) = max{l — |x|,0} be defined 
on R; clearly, M c ip(x) = ip(x) for every x G [—1/2, 1/2], so both functions have the same 
Lipschitz constant: Lip(M c ip) = Lip(^) = 1. Now, this example can be easily adapted 
to higher dimensions (for instance, by making the corresponding function depend only on 
the first coordinate), and it follows that the centered maximal operator on M. d does not in 
general reduce the Lipschitz constant of a function. By way of contrast, we mention that 
the uncentered maximal operator on IR satisfies Lip(M/) < (y/2 — l)Lip(/), cf. Corollary 
13.61 below, while on IR d with the Euclidean norm, the bound Lip(M/) < 2" 1 / 2 Lip(/) holds 
uniformly in the dimension (Theorem 13. 141) . 

As for Holder functions, the preceding example can be easily modified to yield the same 
conclusion. In one dimension, set r](x) := max{l — |x| a ,0} when x < 0, set r](x) := 1 + \x\ a 
on [0, 1/4], and finally, extend r) to [1/4, oo) by reflection about the x = 1/4 axis. Then, by 
concavity of 77 on < x < 1/4, M c r](x) = f](x) for every x G [0, 1/4], and thus both functions 
have the same Holder constant (we mention that the uncentered operator improves Holder 
constants, see formula (13.3. ip below). It is clear that this example can also be adapted to 
higher dimensions. 
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Our aim in this paper is to find best inequalities in the spirit of ( 12.3. ip for the uncentered 
maximal operator, which, as noted in the introduction, has better properties regarding the 
regularization of functions than its centered relative. 

Definition 2.5. A function uj : [0, oo) — > [0, oo) is a modulus of continuity if it is the mod- 
ulus of continuity of a uniformly continuous function, i.e., if there is a uniformly continuous 
function / such that for all 5 > 0, uj (8) = uj (f, 5). 

Remark 2.6. Often a modulus of continuity uj : [0, oo) — > [0, oo) is defined as a continuous, 
nondecreasing, subadditive function (so uj(a + b) < uj(a) + uj(b)), vanishing at zero. Note that 
from continuity and subadditivity it follows that uj is uniformly continuous. It is well known 
and not difficult to check that the modulus of continuity of a uniformly continuous function 
/ has these properties, while given any uj satisfying the above conditions, there is a uniformly 
continuous / such that uj(-) = uj(f,-), namely uj itself. We do not assume that moduli of 
continuity are bounded, so our results apply to general Lipschitz and Holder functions. To 
avoid trivialities, given a generic modulus of continuity, we shall assume it is not identically 
zero. 

The next theorem, and its corollary I2.14[ contain the main integral formulas of the paper, 
valid (in the global case) for all norms and all dimensions. Note that on M. d the value of 
Mf(x) is the same regardless of whether the balls we average over are taken to be open or 
closed. We shall assume whatever is more convenient at any given point. For instance, in 
the next theorem and its proof we suppose that balls are closed. Additionally, we use the 
following notation: a n t b (resp. a n j. b) means that the sequence {a n } converges to b in a 
monotone increasing (resp. decreasing) fashion. 

Theorem 2.7. Let d > 1 and let f be a locally integrable function on M. d . Then, for every 
norm || • || on M. d and every t > 0, 

(2.7.1) uj{Mf,t)< sup inf 1 [ uj (\f\,t\\c + Ru\\) du. 

{ veR d. \\ v \\=l\ {ceR d ,R>0:\\v-c\\<R\ |-D(U, 1J | 7,8(0,1) 

The preceding inequality is optimal in the sense that given a modulus of continuity uj, there 
exists a function ip such that letting f = ip in \2. 1. the following equalities hold: For all 
t > 0, we have uj (ip, t) = uj (t), and furthermore, 

(2.7.2) uj(Mip,t)= sup inf / uj (t ||c + Ru\\) du. 

/ u6R d. ||f|| = l} {cm d ,R>0:\\v-c\\<R} |-d(0, 1)1 Jb(0,1) 



If v is a unit vector, to find its associated infimum in \2. 1. 2\ ) it is enough to consider the set 



(2.7.3) {c G R d , R > : \\v - c|| = R and R < l} . 



As for the extremal functions tp appearing in (2. 7.2 ), they have the following form: If uj 
is bounded, we can take ip(x) = \\u)\\l°°([o,oo)) ~ ^GMD? while if uj is unbounded, we set 
ip(x) = J2^=i(^n — w(\\x — a n \\)) + , where {fi n }i° t 00 an d { a n}i° diverges to infinity so fast, 
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that on the support of each spike the maximal function does not depend on any of the other 
spikes. 

Observe that (12. 7.1ft is stronger, for the uncentered maximal operator, than Kinnunen's 
inequality u (Mf,t) < u(\f\,t), as can be seen by letting R — > (and hence c — > v). Note 
also that the right hand side of (I2.7.2p depends only on the modulus of continuity and on the 
norm. Regarding the extremal functions, our strategy to find them is easy to explain (and the 
proof of the theorem shows that it works): Suppose w is a bounded modulus of continuity, 
and suppose that a given function g has a global maximum at x. Then Mg(x) = g(x), 
and in order to maximize Mg(x) — Mg(y) for each y ^ x, we want to minimize Mg(y). 
Thus g should have the fastest possible decay allowed by u in every direction (and hence the 
least possible mass). This is precisely what if)(x) := |M|l°°([o,oo)) — does. A similar 

observation (in a local sense) can be made for unbounded moduli. Finally, we mention that 
while it is natural to suspect that the centers c in (12.7.31) associated to v should be chosen so 
that B(c,R) C -8(0, 1), Theorem 12.71 does not make any such assertion, and in fact, it may 
be false in general. But we shall see in the next section that it is indeed true for the £±,£2 
and £qo norms. 

Proof. Given / > 0, we assume that the associated modulus of continuity u (f, t) is finite for 
every t > 0, for otherwise there is nothing to prove. Hence, we take / to be locally bounded. 
We may also assume that Mf is not constant, and in particular, that it is not identically 00. 
Next, choose x, y G E and suppose that Mf(y) < Mf(x). Set 

E := {b Gl d ,T> : \\(y - x) - b\\ < T} and F := {a G M d , S > : \\x - a\\ < S}. 

Then, for all (a, S) G F we have E C {b G R d , T > : y G B(a + b, S + T)} by the triangle 
inequality. Hence 

Mf(x) - Mf(y) = sup 1 I f{u)du - Mf(y) 

F \±5{a, b)\ J B (a,S) 

<supinf(— ^— / f(u)du- . / f{u)du 

f e \\B(a,S)\ y B(0jS) \B{a + b,S + T)\ J B ( a +b,s+T) 

<supinff— i— / \f(a + Su)-f(a + b + (S + T)u)\du) 

F E \\B(U, 1)| J B (0,1) J 

< supinf * / u{f,\\b + Tu\\)du 

F E |-D(U, 1)| J B (0,1) 

where the sup F has been deleted from the last line since neither a nor S appear in the integral. 
By symmetry of -8(0, 1), the inequality 

(2.7.4) \ M f(x)-Mf(y)\<mi— i— / oj(f,\\b + Tu\\)du 
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also holds when Mf(y) > Mf(x), and thus it always holds. Writing in ( 12. 7.4ft c\\y — x\\ = b, 
R\\y — x\\ = T, and y — x = \\y — x\\v, where \\v\\ = 1, we have 



\Mf(x)-Mf(y)\< inf , TT^TlW oj(f,\\y - x\\\\c + Ru\\)du. 

{ceR d ,R>0:\\v~c\\<R} 15(0,1)1 J B {0,1) 

Since the right hand side of the last inequality is increasing in \\y — x\\, it follows that for 
every t > 0, 

u(Mf,t)= sup \Mf(x)-Mf(y)\ 

{x,y<=R d :\\x-y\\<t} 



(2.7.5) < sup inf — - [ u(f,t\\c + Ru 

{«6H«»:||«||=1} {ceR d ,R>0:\\v-c\\<R} \B{0, 1)\ J B (0,l) 



u. 



Next we prove that this inequality is sharp. Suppose first that u is a bounded modulus of 
continuity. Set Q := ||w||l°c([o,oo)), an d write i/j(x) := Q — co(\\x\\). Fix t > 0. If x satisfies 
< ||x|| < t, then we can express x = tv, where < \\v\\ < 1, and we get 

u(MiJ),t)>MiJ)(0)-Mil>(x)= inf — / u (t \\c + Ru\\) du. 

{ceK d ,i?>0:||u-c||</?} |-d(0, 1)| Jb(0,1) 

Now the result follows by taking the supremum over all v such that < ||t> || < 1. If u is 
unbounded, we modify ip it as follows: Take a sequence of suitably chosen "spike" functions 
(fi„ — lo (\\x — a n \\)) + , and then set ifj(x) = Yl^Lii^n — w(\\x — a n \\)) + . "Suitably chosen" in 
the preceding sentence means that the heights Q n tend to infinity and the different spikes are 
placed so far apart (i.e., a n — > oo so fast) that on the support of each spike, the others need 
not be taken into account when computing Mip(x) (in particular, different spikes will have 
disjoint supports, so there are no convergence issues with the series defining ip). 

In order to apply formula (12.7.51) . it is useful to narrow down as much as possible where 
the infimum occurs. First we show that it is enough to consider R < 1. Fix a unit vector v 
and suppose R > 1. Since \\v — 0|| = 1 < R, the origin is an admissible center c associated 
to v. But then averaging u)(f, t\\ ■ ||) over -6(0, 1) yields a value no larger than averaging over 
any other ball B(c,R) containing v, for every vector in B(c,R) \ B(0, 1) has norm larger 
than one, hence larger than the norm of any vector in 5(0,1) \ B(c,R), and additionally 
\B(c,R)\B(0,l)\>\B(0,l)\B(c,R)\. 

Next we prove that it is enough to consider pairs (c,R) for which \\v — c\\ = R. Suppose 
|| f — c|| < R. Since the continuous function r i— > rR— \\v— rc\\ changes sign on [0, 1], there exists 
an r G [0, 1] such that \\v — r c\\ = r Q R and, for such an r , we have u(f,t\\r Q c + r Ru\\) < 
u(f, t\\c + -Rii||). Hence, we do no worse by using the pair (r c, r Q R) instead of (c,R), and 
(I2T511 follows. □ 

Remark 2.8. We mention that to obtain (I2.7.3p . instead of the selfcontained argument given 
above, we could have applied Anderson's Theorem (cf., for instance, Theorem 1.11, pg. 376 
of |Gaj . or Theorem 1 of |An] ) . which for nonnegative, integrable, symmetric and unimodal 
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functions /, and origin symmetric convex bodies K C M. d , tells us that 

/ / + cy) dx > / f(x + y) dx, 

J K J K 

where < c < 1 and y G M. d . Since we shall use Anderson's Theorem later on, it is stated 
here for easy reference. 

Definition 2.9. Given a modulus of continuity u, define the Lipschitz space Lip(a;) = 
Lip(u;,X) via the seminorm 

\m-f(y)\ _ u>{f,t) 

lLip( "' X) { , ,ylx% y} u(\\x-y\\) ~ S S$ u>{t) ■ 
Then / G Lxp(cj,X) if (and only if) II/IIlw^ X ) < 00 • I n this section and the next we will 
always have X = M. d , so reference to X shall usually be omitted. Next, set 

(2-9.1) ||M|| 0pM := sup "^" LipH = sup \\Mf\\ Up{oj) . 



ll/llLip(„)#0 \\J\\Up( U ) ll/llLip M =l 

When u (t) = t a and < a < 1, we use Lip Q (X) (or just Lip(a)) to denote the corre- 
sponding spaces of Holder continuous functions and of Lipschitz functions on X, Lip Q (/) 
to denote ||/|| Lip(tlj) , and ||M|| 0p(a) to denote ||M|| 0p(w) . If a = 1, we often omit it, simply 
writing Lip(X) and Lip(/). 

While the notation does not make it explicit, since we are considering functions defined on 
(M. d , || • ||), ||M|| 0p A ) depends both on d and on || • ||. But dimension and norm will always be 
clear from context. 

Remark 2.10. Kinnunen's Theorem (12.31) shows that H-MHop^) — 1 for all sorts of maximal 
operators M., since for every t > and every / with H/H^p^x) — ^ we h ave v(M.f,t) < 
u(\f\,t)<u(f,t)<u(t). 

Example 2.11. Note that for / G Lip(a;), its norm 11/11^^) depends not only on the space 
Lip(a;), but also on the modulus u used to define it (we write norm for short, even though 
we mean seminorm). Consider, for instance, the functions on R that are both bounded and 
Lipschitz. As a set, this space can be defined via many moduli, for example oj(t) := minjt, 1} 
and oj'(t) := min{t, 1/2}. Then if)(x) = (1 — |a;|) + = max{0, 1 — \x\} has norm 1 in Lip(o;) and 
norm 2 in Lip(a/). We also mention that since lim^oo oj (M.ip, t) = 1, we have ||M|| , % = 1, 
so the upper bound 1 can be attained by the uncentered maximal operator. But we shall see 
that for many standard moduli strict inequality holds. 

Remark 2.12. Identifying some extremal functions, as we do in Theorem 12 .7\ allows us to 
immediately improve the general bound 1 in the uncentered case, on Lipschitz functions, for 
every dimension d, and all norms. 

Corollary 2.13. Fix d > 1. Given any norm \\ ■ \\ on IR d ; the associated maximal operator 
acting on Lipschitz functions satisfies ||M|| 0p (i) <d/(d+ 1). 
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Proof. Let ip{x) = max{l — ||x||, 0} (since we are in the Lipschitz case, it is immaterial which 
height Q we select in Theorem 12 .7\ so we just pick Q = 1). Given a vector v with \\v\\ = 1, 
we estimate Mip(v) by integrating over the support of ip, that is, over the unit ball according 
to || • || . This gives a lower bound for Mip on the unit sphere, and hence an upper bound for 
ll^llop(i)- Since for d > 1 the average of the cone over the unit ball is l/(d + 1), we have 
||Af||op ( i)<l-l/(d+l). □ 

Of course, the very general but otherwise rather crude estimates given by the preceding 
corollary cannot be expected to be sharp. In dimension one there is essentially one norm, 
and the best constant is \/2 — 1, as will be seen below, rather than 1/2. We shall show that 
in dimensions two and three the constants 2/3 and 3/4 can be improved when dealing with 
the £\ and ^ norms. 

Let / be a smooth, compactly supported function on R d , and recall that Lip(M/) = 
1 1 DM/ 1 1 oo. Since by the previous corollary ||DM/||oo < d/(d+ l)]]!?/!^, it is natural to 
suspect that if pd is high enough, for every p > Pd one can find a c p G (0, 1) such that 
|| DM f ||p < Cp||D/|| p (with c p independent of /, see also question 2 below). But in this paper 
we study the size of ||£)M/|| P only when p = oo. 

Note also that asymptotically the above corollary does not improve the general upper 
bound 1. Thus, it is natural to enquire whether bounds strictly less than 1 and independent 
of the dimension can be obtained, by a more careful choice of averaging ball. It turns out 
that for cubes (with sides parallel to the axes, that is, ioo balls) and cross-polytopes (£i balls) 
the constant 1 is the correct asymptotic value, and the trivial upper bound d/(d+ 1) from 
the previous result is "essentially" optimal (actually, we shall see later that the lower bound 
(d — 1)/ (d+1) is a better asymptotic estimate). However, substantial improvement is possible 
for Euclidean (£ 2 ) balls, and thus, the same question on the p norm of the derivative arises: 
Can we have c p < 1 for p < oo sufficiently high? And can we take p to be independent of the 
dimension? 

We conclude this section by using the notation from Definition 12.91 to summarize the main 
contents of Theorem 12.71 While formula (I2.14.ip below does not look very promising, due to 
the successive appearance of two suprema and one infimum, the fact is that it will allow us 
to obtain optimal asymptotic estimates for the norms considered in the next section, and in 
some cases we will be able to actually compute the number ||M|| 0p ^. 

Corollary 2.14. Let || • || be a norm on M d ; and let u be a modulus of continuity. Then the 

associated maximal operator M acting on Lip (a;) has norm given by 

(2.14.1) 

H-^llop(w) = SU P SU P r m ^ ~ 77\ I w{t\\c + Ru\ s 



t>o 



{ veR d. ||i,|| = l} {ceR d ,0<R<l:\\v-c\\=R} U(t) 7b(0,1) 1 



Proof. Using one of the extremal functions ip found in Theorem 12.71 we have ||M|| 0p ^ = 
liipftj)) an d now the result follows from (12.7.21) together with ( 12. 7. 3ft . □ 
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3. Bounds in the 4o, £ 2 and i\ cases. 

Next we specialize Theorem 12.71 and Corollary 12.141 to the three norms in the title of this 
section. This specialization will allow us to find explicit constants, at least for low dimensions 
and in the case of cubes (where the needed arguments seem to be simpler). As we mentioned, 
however, all constants will turn out to be exactly the same when working with the and l\ 
norms. 

Theorem 3.1. Let M be the uncentered maximal operator associated to balls defined by the 
loo norm on M. d , i.e., to cubes with sides parallel to the coordinate axes, and letu be a modulus 
of continuity. Then 



(3-1-1) ||M|| 0pH = sup I inf -L, [ U (t \\x\\J 

t>0 I 0<s<lu(t) J[- S> i]d (1 + s 



dx 



or equivalently , 
(3.1.2) 

ll M llo P H= su p{i n t 



0<s<l (1 + s) d u(t) 



u d 1 u(tu)du+ / (it + s) d ~ l u(tu)du 



Moreover, if we choose the same modulus uj for every dimension d, then ||-W|| 0p ^ is nonde- 
creasing in d, and hm ( 2_ 5 . 00 ||M|| aa = 1. 

Of the two expressions for ||-W|| 0p AA given in the preceding theorem, A3. 1 . lj) is the one 
with a clearer geometric content: It says that an optimal choice for v is (1, 1, . . . , 1), and the 
associated minimizing cube contains the origin and is contained in [—1, l] d . On the other 
hand, while harder to interpret, formula (13.1 .2p turns out to be computationally much more 
convenient. 

Proof. From (I2.14.ip it follows that 

I f du 

(3.1.3) ||M|| = sup sup inf 7777V / u (* H c + ^IL) ^F> 

*>° {«GR d :||«|| oo =i}{ ceK<l ' ii>0: ll t '- c ll«» =ii ^ 1 } UJ \ t )A-^] d 2 

so if we write Vq := (1,1,..., 1), then 

(3-1-4) l|M|| 0pM >sup inf — / u(t\\c + Ru\U^. 

We claim that the infimum in (13.1 .4p is attained when c = (1 — R, 1 — R, . . . , 1 — R) for some 
R G (0, 1] (which may vary with the dimension). Let c = (ci, c 2 , . . . , q) satisfy \\v — c||oo = 
R < 1. Then c\, c%, . . . ,Cd > 1 — R. Suppose c\ > 1 — R (else, do nothing and consider c 2 
instead). Translate c to c 1 := (1 — R, C2, ■ ■ ■ , Cd), by moving it parallel to t\. Now, for every 
x G B(c, R)\B(c 1 , R), \xi\ > 1, while for every x G B^c 1 , R)\B(c, R), < 1. Since parallel 
transport in the direction of e\ does not change any of the other coordinates, the average 
value of || - 1 1 oo is not increased. Then repeat this process with each coordinate. 
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Next, note that the infimum is attained when 1/2 < R < 1. This immediately follows from 
the fact that for each u G B(0, 1), the function f(R) := ||(1 — R)v + Ru\\ oo is decreasing on 
(0, 1/2], since each coordinate function |1 — R + Rui\ = 1 — (1 — Uj)R is decreasing there. 
Thus 

(3.i.5) mwzw^^J^mi-sfr+to-u?- 

On the other hand, using (I3.1.3P and, for each each unit vector v, taking the infimum over a 
smaller set of associated centers c, we get 

1 F d 
(3-1-6) \\M\\ om < sup sup inf ~T77T / w(* ||(1 - + J2u|L) 

i>0 | 1 , gRd .|| t ,|| oo=1 } {1/2<,R<1} W (tj 7[_l,l]d i a 

We show that the supremum over unit vectors is attained on Vq, so in fact the right hand sides 
of (13.1.5P and (I3.1.6P are equal. From this, (I3.1.ip follows by making the change of variable 
x = (1—R)vq+Ru, and relabeling s = 2R — 1. Fix R G [1/2, 1], and let v = (vi, . . . ,Vd) satisfy 
||v||oo = 1- By symmetry considerations we may assume that v±, . . . , Vd > 0. Now we argue as 
before. If v\ = 1 do nothing and move to v 2 - Else, vi < 1, so shift v to v 1 := (1, v 2, . . . , Vd) by 
parallel transport in the direction of e\. Then for every x G B((l — R)v, R) \ B((l — R)v l , R), 
{1-R)v!-R < x x < 0, so |ari| < \{l-R)v x -Rl while if x G B({1- Rjv 1 , R)\B((1-R)v, R), 



we have |xi| > (1 — R)v% + i?. Since the average value of increases after the shift and 
the other coordinates do not change, the average value of || - ||oo increases. Then repeat this 
process with each coordinate. 

To obtain (13.1. 2p from (13 . 1 . 1 p . break up the integral appearing in (I3.1.ip into the regions 
{IMloo < s} and {||a;||oo > s}, and then separate these into the sets where \xi\ = ||a;||oo, for 
i — 1, . . . , d. Note, for instance, that if we are working over {||a;||oo > s}n{xi = ||x||oo}, then, 
for a fixed value of X\, the coordinates x 2 , ■ ■ ■ , Xd of the associated vertical section E X1 satisfy 
—s<Xi< X\, so \E X1 \ = (s + Xi)^" 1 . A similar remark can be made about the integral over 
{Halloo < s}, so applying Fubini's Theorem we obtain (I3.1.2p . 

Next we show that the norm of M does not decrease when the dimension changes from d 
to d + 1 if we keep the same modulus u. For notational simplicity, we suppose that d — 1. 
The argument for arbitrary d is the same. Let Sd G [0, 1] be the minimizing value of s in 
dimension d. Then 

-—?——/ u(tm.ax{\x\, \y\})dxdy > / / u(t\x\)dxdy 

(1 + s 2 ) 2 J[- S2 ,ip (1 + s 2 ) 2 J- S2 J- S2 



Lu(t\x\)dx > / uj(t\x\)dx. 

l + s 2 y_ S2 1 + si J_ S1 

To finish, we show that as the dimension d — > 00, for a fixed modulus u> we have ||M||o p (w) — > 
1. This is simply a consequence of the fact that for every s G [0, 1], in high dimensions the 
measure of the cube [— s, l] d concentrates near the norm one vectors. More precisely, let X 
be a random vector, chosen uniformly from [— s, l] d . Then its coordinate functions X{ are 
independent random variables, uniformly distributed over [— s, 1]. By slight abuse of notation 
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we use P to denote both uniform probabilities on [— s, l] d and on [— s, 1]. Fix < e < 1, set 
t — 1, and choose 5 G (0,£:) so that u(l — 8)/ui{l) > 1 — e. Then, for any < s < 1 and all 
sufficiently high d, we have 

PdlXlU >1-S)> P(max{X 1 , ...,X d }>l-6) = l- U d P(X t <1-<J) 



Thus, if the dimension d is large enough (depending of u and e) it follows that 



□ 



Observe that in the above proof we did not need to establish how the optimal value Sd 
of s 6 [0, 1] behaves as d — > oo. Intuition suggests that since the measure of [— Sd, l] d 
concentrates near its border as d grows, in order to minimize the average value of the norm 
over this cube, the origin should be increasingly closer to the boundary of [—Sd, l] d - Or, in 
other words, Sd should approach as d — > oo. This intuition is, in fact, completely erroneous, 
as formula ( I3.2.2|) below shows (see also the proof of Corollary 13. 4ft : When d — > oo, the 
optimal Sd tends to 1, and thus the optimal averaging cube has diameter approaching 
2. Nevertheless, we shall show that in the euclidean case the above intuition is correct: As 
d — > oo the origin must indeed be increasingly closer to the boundary of the optimal ball, in 
order to minimize the average value of the norm, and the £2 diameter of the optimal averaging 
ball must approach 1 rather than 2. This helps to understand why the asymptotic behavior 
of ||M|| 0p (i) is so different in the and £ 2 cases. 

Next we specialize the preceding theorem to the Lipschitz and Holder functions, obtaining 
the following corollary. 

Corollary 3.2. Let a E (0, 1], and consider the space (M. d , \\ ■ \\oo)- Then, on Lip a 
(3.2.1) ||M|| ' ' (/ 



Op(o) 




2 d s a+d s d-l-j _ s d+a 



a + d ^ V 7 7 ol + j + 1 

j=0 v J / 




In particular, when a = 1, that is, for Lipschitz functions on (M. d , || • H^) we have 

d 1 f 2 d s d ~^ 1 

(3.2.2) ||M||o P (i) = — - - -, — 7 max 



d+l d+1 o<s<i [ (1 + s 



Proof. To obtain (I3.2.ip . use (I3.1.2P in Theorem 13.11 with u> (t) = t a , and integrate. As for 
( 13.2.21) . it does not seem to be easy to derive it from (13.2.11) by substituting a — 1. Instead, 
use (I3.1.2p again, evaluating the integral f^(u + s) d ~ 1 udu via the change of variable v = u + s. 
This yields f!3.2.2|) but with max < s <i. To further refine this expression and obtain max 0<s <i, 
we observe that the maximum is actually achieved at some interior point of the unit interval. 
This can be seen by writing gd(s) := s — 2 d s d+1 /(l + s) d for s G [0, 1], and noting that since 
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9d(0) = 9diX) = 0, and g" d < on (0, 1), the function is strictly concave there. Thus, it has a 
unique maximum, which must occur at some interior point. □ 

Remark 3.3. Specializing formula (I3.2.ip to d — 1 and d = 2 gives the following expressions. 
We mention that they can also be obtained easily and directly from (I3.1.ip . 
When d — 1, we find that for every / G Lip Q (lR) and every a G (0, 1], 

(3-3.1) ||M||op ( «) ' ' ! ■ Sl ~" 



mm , 

o<s<i [ (1 + a) (1 

Furthermore, this result is independent of the R norm, since d — 1. The fact that the unique 

minimum occurs in the interior of (0, 1) is shown, as above, by elementary calculus arguments: 

1 _|_ 

Fix a G (0, 1], and for s G [0, 1] write g a {s) := — ^ ^ — y Evaluating g a on and 1 

we see (by inspection) that it achieves its maximum value at these points. Since g" a > on 
(0, 1), the function is strictly convex there, and thus it has a unique minimum. 

It is clear from (13.3. ip that on the real line, ||M||o p ( a ) < (1 + at) -1 , which is the sharp 
bound in the local case, that is, for Holder functions on intervals (cf. Corollary 14.51 below). 
Thus, Holder constants are smaller on the line than on proper subintervals. Note also that as 
a —> 0, ||M||o p (a) — > 1, again by (13.3.11) . Convergence to 1 as a — > holds also in dimension 
2, by fl3X2|) below. 

When d = 2, formulas (13.2.11) and (I3.1.ip become 

(3.3.2) M \\ov(a) = mm 2 ^ 

" P( ) °<*<i V (« + l)(a + 2)(l + S ) 2 

Here calculus arguments regarding extrema are more involved, and in fact, as d grows the 
formulas given by (13.2.11) become less manageable, though of course, numerical estimation is 
possible. However, in the simpler Lipschitz case, where a = 1, there is still a good deal of 
explicit information that can be extracted from (I3.2.2p . Contrary to our usual notation, in 
the next corollary we shall indicate the dependency of the maximal operator on the dimension 
d by writing 

Corollary 3.4. On Lip(R d , || • H^), ||M d ||o P (i) = (d-l)/(d+l)+o(l/(d+l)). More precisely, 

1 



d 1 1 1 n II d If 1 

(3.4.1) — < HM.Hopa, < — - — (l - 



1 



2y/d- 1 



Proof. The lower bound follows from (I3.2.2P by noticing that <?<f(s) := s — 2 d s d+1 /(l + s) d < 
s < 1 for all s G (0, 1]. To get an upper bound it is enough to take (I3.2.2P and assign any 
value from [0, 1] to s. For instance, the choices s = and s = 1 recover the general bound 
d/(d + l) from Corollary EUJ (so ||M d ||o P (i) = (d - l)/(d + 1) + 0(l/(d + 1))), but of course 
one can do better: The right hand side of (13.4. If) is obtained by taking s — 1 — 1/ \fd. □ 

While sufficient to prove asymptotic equivalence, the choice s = 1 — \j\fd~ made above 
is somewhat arbitrary and can easily be improved, at the cost of getting more complicated 
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upper bounds (so the second inequality in ( 13.4. ip is also strict). The optimal choice is the 
unique solution to the polynomial equation given next. 

Lemma 3.5. The norm of the maximal operator M on Lip(IR d , ||-||oo) is obtained by evaluating 
(3.5.1) Ma):= * __L.( a _ - 



d+1 d+1 \ (1 + s) 
on the unique solution Sd inside (0, 1) of the polynomial equation 
(3.5.2) pd(s) := 2 d s d+1 - (1 + s) d+1 + 2 d {d + l)s d = 0. 

Proof. Formula (13.5. ip follows immediately from (I3.2.2p . Recall from the proof of Corollary 
13.21 that the function ga(s) := s — 2 d s d+1 /(l + s) d has a unique maximum on (0, 1), to be 
found by solving g' d = 0, or equivalently, Pd(s) := (1 + s) d+1 — 2 d [d + l)s d — 2 d s d+1 = 0, on 
the said interval. □ 

When d < 3, degp d < 4, so its roots can be found explicitly using Cardan's formula. We 
do this next, thereby obtaining the sharp constant in dimension 1, and for the norm, the 
sharp constants in dimensions 2 and 3. Details are included for the readers convenience. 

Corollary 3.6. On Lip(R) we have ||M||o P (i) = V2 - 1. 

Proof. Solving Pl (s) = l-2s-2s 2 = on (0, 1), we find that /ii(si) = = y/2-1. □ 

Corollary 3.7. On Lip(R 2 , || • W^) we have 

(3.7.1) ||M|| 0p(1) = -L cos (|) + ^ S e C (|) -isec^(l) -3. 

Proof. We use Lemma 1331 finding first the unique root of s 3 +3s 2 — s— 1/3 in (0, 1). The change 
of variable s ^ s-1 leads to the reduced form s 3 -4s+8/3. Since (8/3) 2 -4 4 /3 3 = -64/27 < 0, 
this is the irreducible case in Cardan's formula. Following Viete we write s = (4/v3) y, to 
obtain Ay 3 — 3y = —y/?>/2 = cos (5tt/6 + 2nk), and now we use the trigonometrical identity 
4 cos 3 (0) - 3 cos (0) = cos (30) to conclude that s = (A/y/3) cos (5tt/18 + 2A;tt/3), k = 0, 1, 2. 
Of these three roots only the one corresponding to k = belongs to (1, 2), so S2 = cos || — 
1 G (0, 1). Finally, evaluating (cf. ( 13.5. ip ) on S2 and simplifying once more we find that 

h 2 (s 2 ) ^-cos^-j+^sec^-j-j sec J - 3. 

□ 

Remark 3.8. Therefore ||M||o P (i) w 0.574 on Lip(M 2 , || • H^). 

Remark 3.9. In dimension two, the results f !3.3.2j) and (I3.7.ip hold verbatim if we use the 
£% norm instead of the norm, since in dimension two i\ balls are just rotated cubes. In 
fact, this phenomenon repeats itself in every dimension, despite the fact that the geometry is 
very different when d > 3. Thus, the next result holds also for the i\ norm. 
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Corollary 3.10. On Lip(M 3 , || • Hoc) we have 

2 9/4 



(3.10.1) ||M|| 0p(1 ) = 1 - (( ^ )1/3+( ^ )1/3)3/2 i/on3 . 



1+ ( *!± i 

1 ((v / 8+v / 7) 1 /3 + (^/8-V7)i/3)3/2 



1/2 



Proof. By Lemma I3.5[ it is enough to find the the unique root S3 of p 3 (s) = 7s 4 + 28s 3 — 
6s 2 — 4s — 1 = in (0, 1) and then evaluate h 3 (s 3 ) (see (I3.5.ip ). Using the change of variable 
s = l/(2t — 1), we note that < s < 1 if and only if t > 1, and ^3(5) = if and only if 
2t 4 — 8t + 3 = 0. Now it can be checked by direct substitution that 

((V8 + v/7) 1/3 + (V8 - V7)^) 1/2 ( ( 2 9 / 4 ^ 1/2N 



h 23/4 {^ + ^7)1/3 + (y8_y7)l/3)3/2 1 

satisfies 2t 4 — 8t + 3 = 0, and furthermore, it is the unique t > 1 with this property, by the 
uniqueness of s 3 in (0, 1). Rather than substituting the value of s 3 directly in h 3 , it is more 
convenient to simplify ^3(53) first. Note that 

. , , 3 + 8s 3 + 6s 2 + 7s 4 

»•<«■) = 4(1 + .,)' ' 
and also 7sg = — 28s| + 6S3 + 4s3 + 1, since ^3(53) = 0. Eliminating the fourth order term 
and simplifying we get 



||M||o P (i) = h 3 {s 3 ) = 1 



«3 



1 + S 3, 

Using S3 = l/(2t 3 — 1), the preceding equality becomes ||M||o P (i) = 1 — t^ 3 , and now (13. 10.11) 
follows by substituting in the numerical value of £3. □ 

Remark 3.11. \\M || 0p (i) « 0.66155 on Lip(M 3 , || • H^). 

Next we study the cases p = 2 and p = 1. Since in one dimension all the l v unit balls 
coincide with the interval [—1, 1], the case d — 1 is covered by Theorem 13. II 

Lemma 3.12. Let d > 2, let f : [0, 00) — >■ [0, 00) be an increasing function, and let 1 < p < 
00. Then, for all R > 0, all c = (ci, . . . , q) G R d , and all a 6R such that \a\ < \c\\, we have 



f(\\x\\*)dx< / f{\\x\\ p p )dx. 

B(aei,R) J B(c,R) 

Proof Write c = (02, . . . , q) and x = (x%, . . . , Xd)- Using Fubini's Theorem and Anderson's 
Theorem (cf. Remark 12.81) in d and d — 1 dimensions, we get 

/(|M|;>/.r [ Cl+R ( [ f(\x 1 \ p +\\x\\ p p )dx)dx 1 > 



I 



B(c,R) Jci-R \J B(6,(-Rp-|ci-xi|p)Vp) 

Cl+R ( [ /(N P + \m P P )dx) dx x = [ f(\\x\\ p p )dx > 

d-R \J BiO^RP-lc-L-x^py/p) / JB(c iei ,R) 
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/ f(\Ml)dx. 

JB{aei,R) 



□ 



To apply (I2.14.ip . it is useful to determine on which unit vectors the supremum is attained. 
We have seen that one such vector in the case is (1, . . . , 1). For other p norms, if e\ can 
be selected, this usually leads to simplification of the formulas. The next lemma reduces the 
question of the optimality of e\ to the two dimensional case. 

Lemma 3.13. Let f : [0, oo) — > [0, oo) be an increasing function, and let 1 < p < oo. If for 
all c6R 2 and all R £ (0, 1] we have 

f(\\x\\l)dx< [ f(\\x\\ p p )dx, 

B(c,R) JB(\\c\\ p ei,R) 

then for all d>2, all c £ M. d , and all R £ (0, 1], 

/ f(\\x\\ p p )dx < / f(\\x\\*)dx. 

JB(c,R) J B(\\c\\ p ei,R) 

Proof. Assume the result is true for d > 2. Let c £ and let R > 0. We write c = (c, b), 
where c £ R. d , x = (x,y), x £ M. d , and x = (xi,x),x = (x%, . . . ,Xd) £ IR ' -1 . From Fubini's 
Theorem, induction, and the assumption for d = 2, we get 

rb+R [■ 

f(\\x\\ p p )dx= / / f(\\x\\ p p +\y\ p )dxdy 

'B(c,R) Jb-R J\x 1 -c 1 \P+...+\x d -c d \P<RP-\y-b\P 

rb+R /• 

/ f(\\x\\l + \y\ p )dxdy 

b-R J\x 1 -\\c\\ p \P+\x 2 \P+...+\x d \P<RP-\y-b\P 

f(\\x\\ p v +\x 1 \P+\y\ p )dx l dy}dx 



< 



\\x\\l<RP KJ^-WcW^PMy-b^KRP-WxWl 



< I / f(\\x\\ p p +\x 1 \ p + \y\P)dx 1 dy\dx 

\\x\\f,<RP \ J\x 1 -\\c\\ p \P+\y\P<RP-\\x\\ P ' 



I f(\\n p P )dx. 

J B(\\c\\ p e u R) 



□ 



For euclidean balls, optimality of e±, or any other vector (in every dimension), follows from 
symmetry. Now let d = 2. For l\ balls, optimality of e\ follows from the optimality of (1, 1) 
for balls, since each unit ball and its corresponding norm can be obtained from the other 
via a rotation and a dilation. And in this case e% is strictly better than nearby vectors. We 
have not been able to prove in a direct way the optimality of e\ for other £ p balls, 1 < p < 2, 
even for explicit moduli (approximation arguments seem to yield very limited results). 
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It is well known, and we use it below in the cases p = 1,2, that when 1 < p < oo the measure 
of the unit ball concentrates near the vectors with norm one and first coordinate equal to zero, 
that is, near the "vertical equator" perpendicular to e\. {x G M d : x% = and \\x\\ p = 1}. 
Indeed, the sections of the unit ball perpendicular to e\ are balls in and a small decrease 
in the radius r causes a large decrease in mass whenever d is high, since Lebesgue measure in 
scales like r^" -1 . It follows that the measure of the unit ball concentrates on the sections 
of maximal radius, i.e., when xi ~ 0. Likewise, the measure of the unit ball concentrates near 
the unit sphere. Thus we have concentration near the vertical equator, since the intersection 
of two very large subsets of the unit ball must be large. Of course, when p = oo this 
concentration near the vertical equator does not takes place. 

Let be the £ p unit ball {x e M d : \\x\\ < 1} (for convenience here we take balls to be 
closed), let §p _1 be the corresponding unit sphere, given by {x G R d : ||x|| = 1}, and let |Bp| 
and |Sp _1 | be their respective d and d — 1 volumes. When considering radii r not necessarily 
equal to one, we write Bp(r) and §p _1 (r). 

Theorem 3.14. Let d > 2, let M be the uncentered maximal operator associated to balls 
defined by the £2 norm, i.e., to euclidean balls, and let u be a modulus of continuity. Then 

n „ , (d - 1)T(1 + d/2) 

(3.14.1) M L^, = rap fof 



,1 r (l-«?)V2 

J J u(t{{l-R + R Ul ) 2 + R 2 p 2 ) 1/2 )p d ~ 2 dpd Ul . 

Furthermore, if we select the same modulus u in all dimensions, we have 



i/2<fl<i u (t) v / 7rr(l/2 + d/2) 




(3.14.2) sup <; } < hminf ||M|| 0p(u 
and 

(3.14.3) limsup||M|| 0pH <infsup 

Under the additional assumption that u is concave, the limit exists, it is equal to the left hand 
side of ^3.14-2 ), and bounds \\M\\ 0p ,^ uniformly in d: Given d > 1, 




uj ( 2~h 



(3.14.4) ||M|| 0pM < sup { = lim ||M|| 0pM . 



t>0 



In particular, for the Holder and Lipschitz classes we obtain 

\\M\\ 0p(a) < 2-f for all d>l, and Jim ||M|| 0p(a) = 2"f . 
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In order to give an idea about the size of the constant term (for d fixed) in (13. 14.1ft . we 
point out that 

(3 14 5) ( d \ /2 < T{1 + d/2) -( d+1 Y 2 

{6AA - b) \2j ~ r(l/2 + d/2) -{ 2 J ■ 

This is an easy consequence of the log-convexity of the T function (cf. Exercise 5, pg. 216 
of [Web]). Note also that if instead of taking the infimum in the right hand side of (I3.14.ip 
we just set R — 1, we are averaging over the whole unit ball, that is, we are acting as in 
the proof of Corollary 12.131 And indeed, if we change to polar coordinates and integrate we 
recover the bound dj (d + 1). So here, a better choice of R leads to lower asymptotic bounds. 
We shall see that in fact, when d — > oo the optimal R = R(d) approaches 1/2, as intuition 
suggests. The s-dimensional Hausdorff measure on R d is denoted by % s . 

Proof. As in the case of Theorem l3.ll it is enough to consider balls contained in Bg. To see this, 
fix any unit vector v, and let B(c,R) be a minimizing ball for v in (I2.14p . Suppose B(c,R) 
has points outside M 2 . Translating B(c, R) towards the origin along the ray {tc : t > 0} 
determined by the vector c, so that the displaced ball B(c',R) is fully contained in M 2 and 
tangent to the unit sphere, leads to 



/ lu (t \\c + Ru\\ 2 ) du < / Lu(t\\c + Ru\ 
Jb(o,i) ' •Ab(o,i) 



njj/ du 

?(o,i) Jb(o,i) 

by Anderson's Theorem (see Remark 12. 8p . It may well happen that after the translation 
v ^ B(c', R). If so, rotate B(c! ', R) about the origin to make its new center lie in the segment 
[0,v]. Since this does not change the value of the integral, we conclude that it suffices to 
consider balls contained in ~B 2 . Again by rotational symmetry we may take v to be ei, so it 
is enough to consider centers c = (1 — R)e\ and radii R, with < R < 1. Hence, by (I2.14.ip 
we have 

1 f , ,, „s „ ,, s du 



(3.14.6) ||M|| 0pM = sup inf -L / u ( t ||(1 - R)e x + Ru U2 , 

t>0 0</?<l U{t)J V d 



"2\ 



Note that the infimum in (13.14.61) is attained when R G [1/2, 1]. In fact, we claim that the 
function /^(-R) := ||(1 — R)g\ + -Rw|| 2 is decreasing when R < 1/2, so the minimum must 
indeed occur on 1/2 < R < 1. Differentiating f 2 (R) = (1 - R + R Ul ) 2 + R 2 J^Li u l ~ r2u I 
and using J2t=i u 1 — 1 together with R < 1/2, the claim follows. 

Given u G B^, we write u = (ux,y), where u\ G R and y G ]R d_1 . Denote by Pd the uniform 
probability on Bg, so c/P^m) = du/|B||. To prove (13.14.31) and fj3.14.2p . fix t > 0, and note 
that by concentration of measure near the vertical equator, for each 5 > we have 

(3.14.7) lim P d ({\ Ul \ <5,l~5< \\y\\ 2 < 1}) = 1. 

d— >oo 

Of course, the weaker assertion 

(3.14.8) lim P d ({\ui\ < 5}) = 1 

cf— >oo 
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also holds. Since ||(1 - R)e 1 + Ru\\ 2 < 1 for all ueB d 2 and all R £ [1/2, 1], 

(3.14.9) M f " ( " l(1 -^ 1 + J? " y ^M < PMM > I}). 

Next, note that on B2 D {|wi| < 8}, for every d and every R £ [1/2, 1] we have 

(3.14.10) \\{l-R)e 1 + Ru\\ 2 < ((1 - R + 5f + R 2 ) 1 ' 2 . 

The unique minimum of h\{R) := (1 — R + 8) 2 + R 2 on [1/2, 1] is attained at Rg — (1 + S)/2, 
and there = (1 + 5) 2 /2. Thus, 

(3.14.11) ||(1 - Rs) ei + R s u\\ 2 < 2-^(1 + 5), 

so splitting M 2 into the regions where |wi| < 5 and \ui\ > 8 we obtain 
(3.14.12) 

l/2</?<lJ B d U(t) U(t) 

Taking on both sides of the preceding inequality first sup t>0 , second, limsup^^, and third, 
inf 5>0 , from (13. 14.61) (with R £ [1/2, 1]) we get 

u f2-i(l + 8)t) 

(3.14.13) limsup ||M|| 0pH < infsup V '-. 

(2-s.oo 5 >° t>0 UJ [t) 

This proves (I3.14.3p . 

The argument used to obtain (13.14.21) is similar. Fix u = (ui,y) £ R d and note that on R 
the function 

h 2 (R) :=(1-R + R Ul ) 2 + R 2 \\y\\ 2 2 = ||(1 - fl) ei + Ruf 2 

achieves its unique minimum at R u = (1 — ui)/[(l — u\) 2 + ||y|||], where it takes the value 
h 2 {Ru) = lblll/[(l-«i) 2 +|b|||]. Thus, for < 5 < 1 and M £ M d 2 n{\ Ul \ < 5, 1-5 < \\y\\ 2 < 1} 
we have 

- 1 ~ 6 < - bh < ||(1 - R)e x + RuU 

from which it follows that 

a;(t((l + 5) 2 + l)- 1/2 (l-5)) 

(3.14.14) su P P d ({K| <5,l-8< \\y\\ 2 <l})-± 77\ L 

t>o oj\t) 

(3.14.15) < sup mf / "MV-Xgi + MJ ipM = m . 

t>o i/2<-R<i ,/ B d w(tj p ^ ; 
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Now ( 13.14.14p increases with d, so taking the limit inferior in ( 13. 14. 14ft and ( 13. 14. 15ft as 

d — > oo, we get 



w (t (a + 5) 2 + ir 1/2 (i - 5)) 



(3.14.16) snp < liminf ||M|| } . 

Likewise, the left hand side of the preceding inequality is decreasing in 5 G (0, 1), so (13.14.2j) 
follows by taking the supremum over 5, interchanging it with the supremum over t, and letting 
510. 

Next, write u = (ui,y) G M. d and y = pr], where p = \\y\\2 and \\r}\\2 = 1. By Fubini's 
Theorem 
(3.14.17) 

/ u(t\\(l-R)e 1 + Ru\\ 2 )du= / / u (t ||(1 - R)e x + R(m, y)\\ 2 ) dydu v 

Using polar coordinates on the vertical sections, or equivalently, by the coarea formula, we 
get 



/ 



oo(t\\(l-R)e 1 + R(u l ,y)\\ 2 )dy 

Ha-u?) 1 /*) 



= [ u ((1(1 - it!) + R Ul \ 2 + R 2 \\y\\l) 1/2 ) dU d -\y)dp 

Jo J§t~ 2 {p) 

r (l-u?)V2 

= {d- l))!^ 1 ! / u ((|(1 -R) + R Ul \ 2 + R 2 p 2 f 2 ) p d ~ 2 dp. 

Jo 

Since [IB2 1 = r(i+rf/2) > Q3.14.lj) follows from (I3.14.6p and the preceding equalities. 

Suppose next that u is concave. We show that the inequality in (13.14.4j) holds for every 
d > 1. When d = 1 the result follows from (I3.1.ip and the concavity of u: 

(3.14.18) ||M|| (u) < sup [ u (tx) dx < sup -^—u (t [ xdx] = sup ^I2±. 

Let d > 1 and let t > be fixed. If u (z) is concave, then so is to (tz 1//2 ) . Thus, given 
x > y > 0, we have to ((x — y) 1 ^ 2 ) + to ((2 + y) 1 ^ 2 ) < 2u (x 1 / 2 ). We write the integral 
appearing in (13.14.11) . together with its constant term, as 

I = j J u(t({l-R) 2 + R 2 {u 2 1 + p 2 ) + 2R(l-R)u 1 ) 1/2 )dp 



where 

' 'j 2 I //-2 
■ 2 



1)B^ 1 | d _ 2 , , 
dp = ^ — *-p d 2 dpdui 
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defines a probability on B 2 , R {p > 0}. Actually we get a probability fid for each dimension 
d, but since this is not relevant in the following argument we omit the reference to d in the 
notation. Now from concavity and the fact that u\ + p 2 < 1 we get 

I < 2 t Ud ' oj (t((l - Rf + R 2 (uj + p 2 )) 1 / 2 ) dfi < cu (t((l - i?) 2 + i? 2 ) 1 / 2 ) . 



JO 



Using infi /2 <H<i w (t((l - R) 2 + R 2 ) 1 / 2 ) = u (2~**) we obtain (I3.14.4j) . 

Regarding the assertion about the Holder and Lipschitz classes, for d > 1 the result follows 
from fl3.14.4p with u(t) = t a . And for d — 1, the upper bound is immediate from fl3.14.18p . □ 

Remark 3.15. Consider the Lipschitz functions on M. d . By the preceding theorem, for every 
d we have ||M||o P (i) < 2" 1 / 2 on Lip(M d , || • || 2 ). However, on Lip(M d , || • H^), ||M||o P (i) > 
(d— l)/(d+ 1), by Corollary 13.41 Since 2~ 1//2 < 5/7, any optimal constant in the case, with 
d > 6, is strictly larger than all the optimal constants in the Euclidean case (d — 1, 2, 3, . . . ). 
This illustrates the fact that using different norms on M. d may result in obtaining very different 
best constants. But the opposite can also happen: Best constants are identical for the l\ and 
£oo norms, as we shall prove by showing that the corresponding integral formulas for ||M||op(u>) 
are actually the same. 

Even though we are using different norms to define maximal operators and their associated 
moduli of continuity, we adopt the convention that the s-dimensional Hausdorff measure "H s 
is always defined via the Euclidean £ 2 metric, and normalized by the factor 7r s / 2 /T(l + s/2), 
so if s = d is a positive natural number, the cube of sidelength 1 has Hausdorff d measure 1. 
A consistent definition is required in order to use Fubini's Theorem, or more generally, the 
coarea formula. For example, by our convention the length of the polygonal curve E>\ is 
and not 8, which would be obtained if length were computed using the £i distance instead of 
the Euclidean distance. 

Theorem 3.16. Let d > 2, let M be the uncentered maximal operator associated to balls 
defined by the i\ norm, i.e., to cross-polytopes, and let uo be a modulus of continuity. Then 
(3.16.1) 

did — 1) f 1 r 1- !" 1 ! 

IMIopm = sup J^ R<1 I I " W ~ R ) + Ru A + Rp)) p d - 2 d P d Ul . 



-1 JO 



If we choose the same modulus uj in all dimensions, then we have lim^oo ||M|| 0p /^ = 1. 
Proof. Setting c = (1 - R)v in (j2.14.ip . we get 

ll M llopH<sup sup inf -— / a; (t ||(1 - R)v + Ru\\ x ) M 

*>° {veR d -. ||«|| 1= i} 0<jR<1 w (*)^i?(o,i) I-d(U,1J| 

Optimality of e\ in dimension 2, follows from the fact that in this case cross-polytopes are 
just rotated squares, and for d > 2, by Lemma 13.131 Thus 

(3.16.2) ||M|| 0pM < s^M ^ / „ (t ||(1 - R)e, + R Al ) 
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On the other hand, setting v = e\ in (I2.14.ip yields 

(3.16.3) ||M|| 0pH >sup inf — - / u> (t \\c + Ru\\ x ) — ^— . 

t>0 {ceR d ,l>R>0:\\e 1 -c\\ 1 =R} U{t)JB(0,l) |- y ( U ) i JI 

It follows from Lemma \3. 121 that, given t and R, the infimum is attained in the last inequality 
when c = (1 — R)e\. Hence 

1 f , ,, „s „ ,, s 



II^IIopH > su P n jnf -J- / w (f ||(1 - R)e 1 + RuWJ 

y ' t>0 0<R<1 U{t)J M d 



and thus we have equality. 

Next, write u = (ui,y) G M. d , and y = pr), where p = \\y\\i and \\r]\\i = 1. By Fubini's 
Theorem 
(3.16.4) 

/ u(t\\(l-R)e 1 + Ru\\ 1 )du= / u){t\\{l- R)e 1 +R{u 1 ,y)\\ 1 )dydu x . 

Next we handle the vertical sections. As a reminder, we refer the reader to |Fe] , pp. 248-250, 
or [EG], pp. 117-119, for basic information on the coarea formula 



(3.16.5) / g(y)\Jf(y)\dy= / / g(y)dH d -\y)dt. 

Let the Lipschitz function f(y) be ||y||i. By definition, \ Jf(y)\ = \/ det df(y)df(y) t , so for 
every y with no coordinate equal to zero, a computation shows that \ Jf(y)\ = \Jd — 1. Thus, 
this is the case on almost all IR d_1 . Set g(y) = l/\Jf(y)\ and use (I3.16.5p to obtain 

dy= f f - 7 ^=dn d -\y)dp = 



Jo Jsf- 2 ( P ) Vd-1 

;d-2| r l |sd-2| 
5 1 I / „d-2> l S l 



y/d^TJo (d-l) v / rf 3 T 
From this equality and the coarea formula (once more), with 

cu (t(\(l - R) + R Ul \ + RWyW,)) 



g(y) 



\Jf(y)\ 



we get 



/ u(t\\(l-R)e 1 + R(u 1 ,y)\\ l )dy 



u (t(|(l - i?) + + Rp)) KU, dp 
t 2 (p) ' " 



Vd^T 



Vd^l 



i-|m| 

w(t(|(l-i2) + ifoi| + i^)) / p d - 2 dn d - 2 (y)dp 
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rl-\ui\ 

= {d-l)\M d - 1 \ / u(t(\(l -R) + Ru 1 \ +Rp))p d ~ 2 dp. 
Jo 

It is well known (and easy to compute) that |B^| = so ■ = | and we obtain 

l|M|| 0pH =sup ini ' -±- -I / u(m-R) + Rui\+R P ))p d ~ 2 dpd Ul . 

To see that the infimum is attained when R 6 [1/2, 1], we note that on [0, 1/2] the function 
fi(R) = 1 — R + Rui + Rp has a negative derivative for every p e [0, 1 — \ui\). Finally, 1 
is a uniform upper bound by Kinnunnen's Theorem, and concentration of measure near the 
vertical equator {u G M. d : u\ = and ||w||i = 1} entails that lim^-s-oo H-^llopM = ^ 

Observe, for instance, that unlike the case of cubes and the norm, it is not clear from 
formula (I3.16.ip that ||^||o p ( w ) increases with the dimension. However, this must be the 
case, since given an arbitrary modulus, constants for the l\ and the norms are equal in 
each dimension. Of course this is trivial for d — 1, and clear for d = 2, since in this case 
cross-polytopes are rotated squares. But when d > 2 we have no justification to offer for this 
phenomenon, other than the proof below. 

Theorem 3.17. Fix d > 1 and select a modulus of continuity u. Then ||M|| , ^ has ex- 
actly the same value regardless of whether distances and the maximal operator are computed 
according to the l\ norm, or to the norm. 

Recall that in the case of the 1^ norm the asymptotic value 1 was obtained by using 
concentration near the boundary of the cube [— s, l] d , and more precisely, near the norm 
1 vectors, while in the i\ case we utilized concentration near the vertical equator. By the 
preceding theorem one of these (different) arguments is redundant. 

Proof. Fix t > 0. To see that the values of HMH^^ given by ( 13 . 1 . 1 j) and ( 13.16. lj) are indeed 
the same, it is enough to show that 

inf "I" I / / u{t{\{l-R) + Ru l \+Rp))p d - 2 dpdu 1 

1/2<R<1 2 J_ 1 J Q 



d 



(3.17.1) = Q mf i --—- d { 2 a I z^u (tz) dz + / (z + s) d - l u{tz)dz 



i 



Write 



where 



J u (t(|(l - R) + Rux\ + Rp)) p d - 2 dpd Ul = h + I 2 , 
h:= [ [ ' oo(t((l-R) + R(u 1 + p)))p d - 2 dpdu 1 



R— 1 

R 



24 J. M. Aldaz, L. Colzani, J. Perez Lazaro 

and 



-1+ux 



h'= J J u(t(R-l + R{p-u x ))) p d ~ 2 dpdu v 

To compute Ji, we use the change of variables v = 1 — R + R{ui + p), y = Ru\ and Fubini's 
Theorem: 

rR ri-\v\+u ( v + R _i^ y y-2 
h= / u(tv) — d dvdy 

JR-I Jl-R+y K 



1 r v+R- 1 

u(tv)(v + R-1- y) d ~ 2 dydv 



Jmax{i?-l,3^i} 



1 

~ w 

" " " ""—I-"' *■! 2 

Likewise, to compute J2 we set t> = i? — 1 + R(p — Ui), y = Rui, and interchange the order 
of integration: 

-fl-l r 2R-l / 1 1 D , „.\d-2 1 /-2.R-1 



N . v + 1 - R + y) d ~ 2 , , 1 r"- 1 , s , l7 

j R -i- y u ^ # dvdy = md^) L u{tv)v ~ 



Adding up we obtain 



1 ( f 1 AV + 2R-1Y- 1 , . „ r 2 *- 1 . ^ 



u(tv)- ' dv + 2 Lo(tv)v d ^dv . 



To finish, set s = 2R — 1, multiply by d(d — l)/2, and take the corresponding infima to get 

We conclude this section with four questions and some variants of these. We conjecture 
that the following version of Theorems 13. 141 and 13. 161 holds for all p G (1,2) and d > 2: For 
every modulus of continuity u we have 

(d-l)T(l + d/p) 

" t >o o<r<i 2u (t) r(i + i/p)r(i + (d- i)/p) 

pi y(l-[tti n VP 

y y w (t(i(i -i?) + j rm 1 i p +/?v) 1/p )p^ 2 ^i- 

Let q — p/(p — 1) be the conjugate exponent of p. Choose the same modulus of continuity ui 
in every dimension d. Then 



(3-17.2) ||M|| 0pH = sup Jnf , n il^T 7 ^ TTTZT x 



liminf ||M|| 0pH > sup 



w 2~~H 



and 



limsup ||M|| ( ) < inf sup 



cu ( 2 <j r£ 



>1 t> I U) (t) 
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Suppose additionally that u is concave. Then for every d > 1 



(3-17.3) ||M|| 0p{ <sup 




In particular, for the Holder and Lipschitz classes we obtain 



\\M\\ 0v(a) < 2"? for all d > 1, and lim ||M|| 



2"?. 



The main obstacle for proving this result would be removed by a positive answer to the 
next question. 

Question 1. Is e\ a maximizing vector for 1 < p < 2 when taking the supremum in 
fl2.14.ip ? Recall that by Lemma [3. 131 it is enough to consider the case d = 2. 

It is plausible that best bounds increase with the dimension for 1 < p < oo, as it happens in 
the case p = oo. This would immediately imply that the asymptotic bounds are also uniform 
upper bounds. 

As we mentioned before, it seems likely that v p := c/~ 1 / p (l, 1, . . . , 1) is an optimizing vector 
when 2 < p < oo. But while using e± leads to simplification of the integral formulas, using 
v p does not. Furthermore, e\ fits well with Fubini, in the sense that sections of <i-balls yield 
d — 1-balls if the sections are perpendicular to e±. This does not happen with v p , so a change 
of coordinates would not help on that respect. In any case, we suspect that on the range 
2 < p < oo, as p — > oo constants become increasingly worse and approach the bounds that 
hold for p = oo. 

Question 2. We have seen that if / is Lipschitz and M is the maximal function associated 
to euclidean balls, then for every dimension d, Lip(M/) < 2~ 1 / 2 Lip(/), or equivalently, 
ll-DM/Hoo < 2~ 1/ ' 2 ||D/|| 00 . It is natural to expect a similar behavior for p "close" to oo. More 
precisely, given c G (2~ 1//2 , 1), is it possible to find a p c such that for all p > p c , if Df G L p , 
then \\DMf\\ p < c\\Df\\ p ? Or given p » 1, is it possible to find such a c G (2" 1 / 2 , 1)? 

Current methods of proof use the L p inequalities satisfied by M to obtain W 1,p results. 
Since ||M/|| P > ||/|| p always, this type of argument will never yield constants below 1. On 
the other hand, if we fix d, by Corollary we have H-DM/H^ < (1 + d)~ l d\\D fW^ for 
the maximal function associated to any ball, so it is natural to seek inequalities of the form 
|| DMf ||p < Cp || .D/ ||p, where c p < 1, p is high enough, and M is defined via an arbitrary 
norm. At the other extreme, given / G W /1,1 (lR d ) with d > 2, it is not known whether 
there is a constant C\ (independent of /) such that ||DM/||i < ci||/|| w i,i(- R d). We mention 
that for some related maximal operators, such as, for instance, the strong maximal operator 
(where averages are taken over rectangles with sides parallel to the axes) such constants do 
not exist, cf. Theorem 2.21 of [AlPe2] . It follows from Theorem 2.5 of [AlPeJ that if d = 1 
then ||Z)M/||i < ci||£)/||i, and C\ = 1 is sharp. For d > 2 it is clear that if any such constant 
exists, it must be strictly larger that 1 (take / to be radial, for instance). Whether or not 
Ci < oo for d > 2, one would expect c p > 1 for small values of p, c p < 1 for sufficiently large 
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values of p, and c p = 1 for some "crossing" p, which will likely depend on d and the ball used 
to define M. A motivation to obtain detailed information on DMf comes not only from the 
possible use of DMf as a substitute for Df, but also because it will yield new information 
about the maximal function itself. 

On a more speculative mood, we note that where balls of arbitrarily small radii have to 
be taken into account, the function is "large" and the maximal function coincides with it, 
so maximal functions with smaller Lipschitz norms, and hence lower rate of decay "from the 
top", will tend to be larger in an L p sense. Since asymptotically constants are smaller for 
£ 2 balls than for cubes or t\ balls (and we believe this is also the case for other balls), it is 
tempting to conjecture that the maximal function associated to Euclidean balls is at least 
"as efficient" at capturing mass as maximal functions associated to other balls. Our results 
suggest this only in the weakest possible sense, since different norms are used to compute 
distances, the directions we consider when measuring the modulus of continuity are those of 
fastest decay and not some "average direction" , and L p norms of extremal functions vary with 
p (at least in the bounded case). Nevertheless, it seems worthwhile to try to find out whether 
the Euclidean maximal function, both in the centered and uncentered versions, has larger 
operator norm on L p spaces than maximal functions associated to other balls (needless to 
say, weak type results in this line would also be interesting). In fact, the weaker "comparison 
theorem" suggested next would already have many consequences. 

Question 3. Let Ai e denote the maximal operator, either centered or uncentered, as- 
sociated to Euclidean balls, and A4b the corresponding (centered or uncentered) operator 
associated to some other ball (defined by a different norm). Prove or refute the following 
statement: For every p G (1, 00), 

Of course, 10 78 is not important here, any other constant c would do. But we do mean to 
emphasize that 10 78 does not depend on anything, in particular not on p or d. "Constants" 
that depend on the dimension are trivial to obtain by the equivalence of all norms in M. d . The 
preceding conjecture, if true in the centered case, would imply that the uniform bounds in the 
dimension proved by E. M. Stein for Euclidean balls also hold for all other balls (including 
for instance, cubes) and all p > 1. An opposing viewpoint can be found in |Muj . pg. 298, 
where it is suggested that this result may be false for cubes and p < 3/2. 

The corresponding version of question 3 for the weak type (1,1) constants is also interesting. 
An affirmative answer would entail that the best constants for the centered maximal function 
defined using euclidean balls diverge to infinity with the dimension, since this is the case 
for cubes, cf. |A12] . Thus, a long standing open problem by Stein and Stromberg would be 
solved, cf. |StStj . 

From the perspective of comparing the sizes of the maximal operators associated to eu- 
clidean balls and to cubes, it would probably be more telling if Lipschitz or Holder constants 
were computed using the same distance (for example, the euclidean norm) in both cases. 
Of course, this "decoupling" between cubes and the £ 2 norm means that we are outside the 
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scope of Theorem 12.71 However, the maximal operator associated to cubes fits well with the 
product structure of R d , so it is natural, and common, to use it together with the natural 
distance on R d , the euclidean length. The next question seems therefore interesting to us: 

Question 4. If in Theorem 13.11 we keep the maximal operator associated to cubes but 
consider the £2 instead of the norm, how do the conclusions change? 



Next we study the local case in one dimension, that is, when the domain is a proper 
subinterval of R. While the idea of the proof is the same as in the previous results, formally 
the next theorem does not follow from them, so we include the full argument. In fact, notation 
is considerably simplified by the fact that only intervals with x as one endpoint need to be 
considered when computing Mf(x). In order to define any such interval it is enough to specify 
the other extreme, there is no need to talk about centers, radii and the relations between 
them. 

We note that constants are worse in the local case than in the global case due, to the fact 
that a proper subinterval J of M has at least one boundary point, say, for instance a left 
endpoint a. Then the decay restrictions imposed by the modulus of continuity only hold to 
the right of a, and so the level sets of an extremal function will be in general smaller than 
they would be if the function were defined over the whole real line. This is reflected in the 
different integral formulas; recall that for R, Theorem 13.11 tells us that 



The existence of a boundary point entails that we must take s = in the local case, cf. (14.3. ip 
and (gZl below. 

Remark 4.1. If / is uniformly continuous on a bounded interval /, then its modulus of 
continuity is constant on [| J|, 00), with value u(f, Thus, if we are given a modulus u, 
and |/| < 00, we cannot expect to find a function ip G Lip(w,J) with u(ip,t) = u(t) for all 
t > 0. In general, the best we can do is to find 1/; so that co(ip,t) = u(t) on (0, |/|]. 

Remark 4.2. By an interval / we always mean a nondegenerate interval, so the empty 
set (a, a] and points [a, a] are excluded. Also, the requirement that subintervals be proper 
leaves out the already studied case / = R. Other than that, there are no restrictions on the 
subintervals of R considered in Theorem 14.31 Nevertheless, for convenience we shall assume 
in the proof that intervals are not open. The open case is handled in essentially the same 
way, via a limit argument. So / will contain at least one endpoint. To simplify notation, we 
shall assume that this endpoint is the origin, and furthermore, that it is the left endpoint of 
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Theorem 4.3. Let I C R be a proper subinterval, and let f : I — >• R 6e locally integrable. 
Then, for every t > 0, 

(4.3.1) u(Mf,t)< f u{\f\,tu)du. 

Jo 

Inequality ^.S.l ) is sharp in the sense that for every modulus of continuity uo and every 5 > 
with 5 < \I\, there exists a nonnegative, uniformly continuous function ip : I — y R such that 
for all t G (0, 5), we have to(ip, t) = uu(t) and 

(4.3.2) u{Mifj,t)= [ u(tu)du. 



Moreover, if uo is bounded, then the function ip can be chosen so that it satisfies u(ip,t) = uo(t) 
and for all t > with t < \I\. 

Proof. Assume / > 0, and note that when we evaluate Mf(x), taking the supremum over 
all intervals containing x yields the same value as taking the supremum over intervals having 
x as a boundary point (in other words, Mf is the maximum of the right and left one sided 
maximal functions). Let x,y G / be such that Mf(y) < Mf(x). Then 

M/(z)-Af/(y) = sup— !— / f(u)du-Mf{y) 
sei o — xj x 



sei 



1 



5 i r S 



< sup I — / f{u)du — — / f(u)du 



S — x J x S — y 



(4.3.3) <sup f 1 \f(x + (S - x)u) - f(y + (S - y)u)\du < I u{\f\ , \x - y\u)du. 

sei Jo Jo 

Now using a symmetry argument between x and y, and taking the supremum over \x — y\ < t, 
(I4XTD follows. 

We prove the optimality of (14.3.11) on the interval I = [0, oo). The argument can be easily 
adapted to other proper subintervals of R. If u is bounded, set ip := ||cj||l°°([o,oo)) — 
It is immediate from the definitions that u(ip,t) = u(t). Since ip is decreasing, Mip(t) = 
t^ 1 J* ip{u)du = Jq 1 ijj(tu)du, so 

(4.3.4) co (Mip, t) > Mip(0) - Mip(t) = ||w||loo ([0iOo)) - f ip(tu)du = [ u (tu) du. 

Jo Jo 

Thus, by f)4.3.ip we have equality. If u is unbounded, the result follows from the previous 
case by fixing 5 > and considering the bounded modulus us(t) := mm{u(t),u(5)}. □ 

Remark 4.4. In f)4.3.4p we are crucially using that t < Suppose / = [0,1], and let 
u(t) = u(if),t) = min{t, 1}, where ip(x) = 1 - x. Then for all t > 1, u(Mif>,t) = 1/2 < 
1 = lim^oo J 1 u (tu) du. For the same reason, the restriction t < \I\ is also needed in (14.5. ip 
below. 



Modulus of continuity of the maximal function 



20 



Recall that H/Hup^,/) — sa 'Pt > ow(f,t)/(jj(t). In the next result we use the following no- 
tation: ipT '■ [0, oo) — > [0, oo) is defined by ipT '■= (T — uj) + . We shall see that only the 
functions ipT need to be taken into account when computing ||M||o p (w). If u> is bounded we 
set T = ||w||i<»([o,oo)) ( as usual) and just write ip. 

Corollary 4.5. Let u be a modulus of continuity and let I be a proper subinterval o/R. On 
Lip(w, I), 

1 f 1 

(4.5.1) ||M||o p (a)) = sup — — / uj(tu)du. 

0<t<\I\ u[t) J 

In the special case where uo(t) = t a , a e (0, 1], that is, over the Holder and Lipschitz classes, 

(4.5.2) l|M||op ( a) ' 



1 + a 



Proof. If u is bounded, (14.5. ip follows immediately from the fact that there is an extremal 
function ip such that for all t > with t < \I\, u)(ip, t) = uo(t) and oo (Mtp, t) = L u (tu) du. 
And if |/| < oo, replacing uj by uj' := min{a;, a;(|/|)}, we are back to the bounded case. 

Suppose next that both uj and / are unbounded. Without loss of generality we may assume, 
in order to simplify notation, that I = [0, oo). Select for each n > 1 a norm one function 
f n G Lip(oj, J) approximating the operator norm of M to within 1/n. Then 



1 1 f 1 1 1 f 1 

||M||o p h < ||M/ n || Lip{ti;) + - < sup—— / u(\f n \,tu)du + - < sup—— / 

n t >o u{t)J n t>o u{t)J 



u (tu) du-\ — . 

n 



On the other hand, since each ipT has norm 1, 



||AfHop(w) > sup ||M^||Lip(w) = sup { sup [ u (ip T ,tu) du 

T>0 T>0 l0<t<T UJ{t) J Q 

= SUp { SUp — rrr / UJ (tu) du 1 = SUp — |— / UJ (tu) du. 
T>0 {0<t<T W{t) J J t>0U[t)J 

Ifu(t) = t a , ae (0,1], then 

-1 pi i 



1 

o<t<|/| u(t)J Jo l + a 



||-W||op(w) — sup — j-^ I uj (tu) du = I u a du 



□ 



Remark 4.6. The sharp bounds given above yield information about related inequalities. 
For instance, in Theorem 5.1 of |AlPe] the following Landau type inequality is proven: If 
u : [0, oo) — > R is an absolutely continuous function such that its derivative is of bounded 
variation, then 

(4.6.1) Wu'Wl < 48|MU (||DM( M /+ )|| 00 + UDMOOIloo) . 
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It is natural to seek a lower bound on the best possible constant c that can replace 48 in 
(14.6.11) . Recall that the classical (sharp) Landau inequality assumes more regularity on the 
part of u'\ If it is absolutely continuous on [0, oo), then 

(4-6.2) Wu'foo <4||m|| 00 || m // || 00 . 

As noted in Remark 5.5, of |AlPe] . (14.6.11) implies Landau's inequality, save for the issue of 
best constants. Using Corollary S3] and ( KITH we have that ||DM(u /+ )|| 00 + ||DM(u'-) < 
2 _1 ||£)tt' + || 00 + 2~~ 1 \\Du'~ Hoc < UDm'Hoo, so 4 < c < 48. On the whole real line, the constant 
from Theorem 5.1 of |AlPe] appearing in (14.6.11) is 24 instead of 48, and the best constant in 
the sharp Landau's inequality (14.6. 2ft is 2 instead of 4. Arguing as before and using Corollary 
EJ2 we have \\DM(u' + )\\ 00 + WDMfa'-)^ < 2(y/2 - l^Du'W^, so 1/(^2 -1) < c < 24. 
Thus, on R the best constant appearing in the generalized Landau inequality ( 14.6. ip is strictly 
larger than the best constant 2 in the classical Landau inequality. 

Remark 4.7. By Corollary 14.51 the functions ip a {x) := max{l — x a ,0} are extremal in the 
classes Lip(a)([0, oo)). Now Mip Q (x) = 1 — on < x < 1 (and Mip a (x) 



a: 



1 + a (1 + a)x 

on 1 < x < oo). Thus, the Holder or Lipschitz exponent of Mip a is no better than that of 
ip a . But in this example there is only one "bad point": For every e > 0, both ip a and Mip a 
are Lipschitz on [e, oo). In view of the regularizing properties of M, one may wonder, for 
instance, whether Mf must have a better Holder exponent than / save for small sets, or in 
some "almost everywhere" sense. We shall see below that the answer is negative. 

We also note that the preservation of regularity does not extend to the C r classes. To see 
this, let / be the sum of two smooth bump functions with disjoint supports and recall that 
Mf is the maximum of the right and left one sided maximal functions. This entails that 
between the bumps of /, Mf achieves its minimum value at a point of non-differentiability, 
since the right and left derivatives of Mf are nonzero there and have opposite signs. 

Next we show that a Holder condition on / is not sufficient to ensure the differentiability 
almost everywhere of Mf, so in particular Mf can fail to be absolutely continuous. The 
following example also shows that the Holder exponent of Mf may be as bad as that of / 
on a large set. To prove this we suitably modify the fat Cantor set defined in Example 4.2 of 
|AlPej . and the functions used there. 

Example 4.8. For every a G (0, 1) there exists a function f G Lip a ([0, 1]), with Lip Q (/) = 1, 
such that Mf is not differentiate on a set E of positive measure. Furthermore, given any 
j3 G (a, 1), Mf is not locally Holder (j3) at any point of E. More precisely, given any x G E, 
and any interval I , relatively open in [0, 1] and with x G I, we have Mf ^ LipJl). 

Proof: As in Example 4.2 of [AIPe], and unlike the usual construction of the Cantor set, 
instead of removing the "central part" of every interval at each stage, we remove several parts. 
Let F = [0, 1] and let F n be the finite union of closed subintervals of [0, 1] obtained at step 
n of the construction, to be described below. As usual C := n n F n . Denote by £(J) and r(J) 
the left and right endpoints of an interval J. At stage n we remove 2~ 2n of the mass in the 
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preceding set, so |C| > 1 — | Uf F£\ > 1 — 2 ~ 2n = 2/3. Let J n _i be a component of F n _i, 
and let 0(n,x) the open interval centered at x of length 2~ 4n \I n _ 1 \. Select the unique 2 2n 
points x±,x 2 . . . , x 2 2n G 7 n _! such that i) £(I n -i) < Xi < • • • < x 2 2n < r(/ n _i), and ii) after the 
removal of the 2 2n open intervals 0(n, Xj) from I n -i, the 2 2n + 1 remaining components have 
equal length, which therefore must be (1 — 2~ 2n )|/„_ 1 |/(2 2 ™ + 1). Thus, x 2 — X\ = x i+1 — X; t 
for % < 2 2n , while r(I n -i) — x 2 2n — x± — £(I n -i) < x 2 — x\. Note that the extremes of are 
left untouched. 

Repeat the same process with the other components of F n _i to obtain F n , which by con- 
struction is a disjoint union of closed intervals, all of length (1 — 2~ 2n )|/ n _ 1 |/(2 2n + 1). 

Next we define the function /. Given t > 0, set g t (x) — 1 — (t — x) a on [0, t) and extend it 
to (—t,t) as an even function. Then let / be identically 1 on C, and let f(x) := g t (x — s) on 
each interval of type 0(j, s), where t is half the length of 0(j, s). To see why / G Lip a ([0, 1]) 
and Lip a (/) = 1, note that if x G C and y G [0, 1] \ C, say, with y < x, then there is a 
unique 0(n,w) containing y, so we can replace x by r(0(n,w)) and then f(x) — f(y) = 
f(r(0(n,w))) — f(y), while r(0(n,w)) —y<x — y; the case where x and y belong to different 
components of [0, 1] \ C can be reduced to the previous one: Let y G 0(n, w), x G 0(m,w), 
and suppose y < x. If f(x) < f(y), replace y with £(0(m,w)); otherwise, replace x with 
r(0(n,u)). So it is enough to verify that / has the desired properties on the closure of an 
arbitrary 0(j, s), and this is true since it holds for \x\ a , its translates, reflections, etc. Next, 
let /„ := 1 on F n and f n := / on [0, 1] \ F n . Then / < /„, so Mf < Mf n , and of course 
the latter function is easier to work with. Let Xi and 0(n, Xi) be as above. For notational 
convenience, suppose we are taking % — 2; the same reasoning applies to the other Xj. To 
evaluate Mf n (x 2 ) we only need to consider intervals J that have x 2 as an endpoint and, we 
claim, omit both x\ and £3. To see this, assume that x 2 = £(J). Since f n (x 2 ) < Mf n (x 2 ) < 1, 
an optimal J must satisfy / n (r(J)) = Mf n (x 2 ). Otherwise we could improve the average by 
either shortening or enlarging J. Thus, r(J) G 0(n,x 3 ) fl {x < x 3 }. Of course, the situation 
is completely symmetrical if x 2 = r{J) (here £(J) G 0(n, X\) fl {x > x±}), so we keep the 
assumption x 2 = £(J). Next, define h > f n by setting h = 1 on 0(n,x 3 ) fl {x < x 3 } and 
h = f n elsewhere. Clearly 

M U*2) < ~ / h = n ,0-4n M T I 

X3 ~X 2 J X2 ( 2 2n + 1 + 2 ^ n )\l n -l\ 



2 (-4n-l)(Q+l) 

■^ + 2-*»)(a + l) 



1 _ I t la ^ 1 o— 4ara— 2n— a— 2 1 r la 

1 0-9^ — -Mn-1 <1 — Z • 



Fix z G C fl {M/ = 1}. For each n, let be the component of F n that contains z, and let 
u» n be midpoint of the nearest interval 0{n) deleted at step n (if there are two such midpoints, 
pick any). Then \z — w n \ < 2 _2n |/ n _i|, and furthermore \I n -i\ < 2~ n ( n ~ l \ since the number of 
components of the set F n _i is U™=i( 22i + x ) > U^=i 2 ^ = 2 n(n - 1 \ Thus, for every f3 G (a, 1] 
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we have 



\Mf(z)-Mf( W )\ ^ 1 — Mf n (w n ) 2 -4 a n-2n- a -2 

hmsup > hmsup > lim {n _ l)¥ _ a = oo. 

Since M/ > / a.e., the sets £ := Cfl {^f/ = 1} and C have the same measure, so taking 
/3 — 1 it follows that M/ is not differentiable on a set of measure at least 2/3, while for 
E (a, 1) we obtain the failure of the local Holder condition with exponent /3. □ 

Remark 4.9. The following example shows that local, higher dimensional generalizations 
of the preceding results may fail. Recall that in Definition 12.11 the balls in M. d we av- 
erage over, are assumed to be fully contained in the (open) domain U of /. Let U = 
{(x,y) : x E R d , y E R, y > exp (—1/ |x|)} be a domain with a cusp and let f(x,y) = y. 
Then lim^ Mf(0, y) = 0, while Mf(0, 1) = oo. 
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